PyBioMed: a python library for various molecular representations of chemicals, proteins and DNAs and their interactions
نویسندگان
چکیده
BACKGROUND With the increasing development of biotechnology and informatics technology, publicly available data in chemistry and biology are undergoing explosive growth. Such wealthy information in these data needs to be extracted and transformed to useful knowledge by various data mining methods. Considering the amazing rate at which data are accumulated in chemistry and biology fields, new tools that process and interpret large and complex interaction data are increasingly important. So far, there are no suitable toolkits that can effectively link the chemical and biological space in view of molecular representation. To further explore these complex data, an integrated toolkit for various molecular representation is urgently needed which could be easily integrated with data mining algorithms to start a full data analysis pipeline. RESULTS Herein, the python library PyBioMed is presented, which comprises functionalities for online download for various molecular objects by providing different IDs, the pretreatment of molecular structures, the computation of various molecular descriptors for chemicals, proteins, DNAs and their interactions. PyBioMed is a feature-rich and highly customized python library used for the characterization of various complex chemical and biological molecules and interaction samples. The current version of PyBioMed could calculate 775 chemical descriptors and 19 kinds of chemical fingerprints, 9920 protein descriptors based on protein sequences, more than 6000 DNA descriptors from nucleotide sequences, and interaction descriptors from pairwise samples using three different combining strategies. Several examples and five real-life applications were provided to clearly guide the users how to use PyBioMed as an integral part of data analysis projects. By using PyBioMed, users are able to start a full pipelining from getting molecular data, pretreating molecules, molecular representation to constructing machine learning models conveniently. CONCLUSION PyBioMed provides various user-friendly and highly customized APIs to calculate various features of biological molecules and complex interaction samples conveniently, which aims at building integrated analysis pipelines from data acquisition, data checking, and descriptor calculation to modeling. PyBioMed is freely available at http://projects.scbdd.com/pybiomed.html .
منابع مشابه
BioTriangle: a web-accessible platform for generating various molecular representations for chemicals, proteins, DNAs/RNAs and their interactions
BACKGROUND More and more evidences from network biology indicate that most cellular components exert their functions through interactions with other cellular components, such as proteins, DNAs, RNAs and small molecules. The rapidly increasing amount of publicly available data in biology and chemistry enables researchers to revisit interaction problems by systematic integration and analysis of h...
متن کاملNew molecular and biochemical records for Mindium laevigata at various developmental stages
It is essential to identify and determine the properties of native plants as natural genetic resources. The present study was performed to identify the Mindium (Michauxi) laevigata species using molecular and biochemical procedures such as genomic DNA extraction, sequencing, and antioxidant capacity and protein content determination at both vegetative and generative phases in various parts of t...
متن کاملIsolation and Characterization of Novel Phage Displayed scFv Fragment for Human Tumor Necrosis Factor Alpha and Molecular Docking Analysis of Their Interactions
Tumor necrosis factor alpha (TNF-α) expression amplifies to excess amounts in several disorders such as rheumatoid arthritis and psoriasis. Although, Anti-TNF biologics have revolutionized the treatment of these autoimmune diseases, formation of anti-drug antibodies (ADA) has dramatically affected their use. The next generation antibodies (e.g. Fab, scFv) have not only reduced resulted immunoge...
متن کاملAnalysis of Molecular Interactions Using the Thermophoresis Method and its Applications in Neuroscience and Biological Processes
Introduction: Molecular interactions play an important role in the phenomenon and biological processes. In fact, any cellular biological process ranged from genetic replication to the production of various proteins to the transmission of neurological, hormonal, membrane involves collections of molecular interactions that occur continuously. Interference in each of these processes at every stage...
متن کاملJAK-STAT pathway and JAK inhibitors: a primer for dermatologists
Background: All cellular events depend upon the DNA synthesis and gene expression involving complex interplay between ligands such as interleukins and interferons, with various cell membrane receptors. These ligand-receptors interactions transmit signals within the cell via numerous signal transduction pathways to affect gene expression. Janus kinase/signal transducer and activator of transcrip...
متن کامل